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Protein-protein interactions are of utmost importance to an 
understanding of biological phenomena since non-covalent 
and therefore reversible couplings between basic proteins 
leads to the formation of complex regulatory and adaptive 
molecular systems. Such systems are capable of maintaining 
their integrity and respond to external stimuli, processes inti- 
mately related to living organisms. These interactions, howev- 
er, span a wide range of dissociation constants, from sub-nano- 
molar affinities in tight complexes to high-micromolar or even 



millimolar affinities in weak, transiently formed protein com- 
plexes. Herein, we demonstrate how novel NMR and EPR tech- 
niques can be used for the characterization of weak protein- 
protein (ligand) complexes. Applications to intrinsically disor- 
dered proteins and transiently formed protein complexes illus- 
trate the potential of these novel techniques to study hitherto 
unobserved (and unobservable) higher-order structures of pro- 
teins. 



Introduction 

Protein-protein interactions are fundamental chemical events 
in living organisms and thus of prime interest to biochemical 
research in many fields of life science. Although protein X-ray 
crystallography has delivered an enormous amount of highly 
refined structural data of large protein aggregates its applica- 
bility is limited to high-affinity and stably structured protein 
complexes. In the recent past, it has yet become clear that also 
weak protein-protein interactions leading to transiently 
formed complexes play key roles in diverse biological process- 
es, such as cell signaling, post-translational modifications, gen- 
eral transport phenomena, cellular trafficking, enzyme catalysis, 
and transcription as well as translational regulation. Additional- 
ly, there is growing awareness of the importance of intrinsically 
disordered proteins (IDPs) in eukaryotic life. Their central role 
in protein interaction networks has been established. IDPs effi- 
ciently sample a vast and heterogeneous conformational space 
allowing them to interact with multiple and diverse binding 
partners. It is therefore a great and rewarding challenge for 
modern structural biology research to access these proteins in 
the completeness of their native states and to adequately de- 



[a] T. Sara, T. C. Schwarz, Dr. D. Kurzbach, Prof. R. Konrat 
Department of Structural and Computational Biology 
Max F. Perutz Laboratories 

Vienna Biocenter Campus 5, 1030 Vienna (Austria) 
E-mail : robert.konrat@univie.ac.at 

[b] C. H. Wunderlich, Prof. C. Kreutz 

Institute of Organic Chemistry and CMBI, University of Innsbruck 
Innrain 80/82, 6020 Innsbruck (Austria) 
[**] This article is part of the Virtual Special Issue "Structure Characterization of 
Biomolecules" 

© 2014 The Authors. Published by Wiley-VCH Verlag GmbH & Co. KGaA. 
This is an open access article under the terms of the Creative Commons 
Attribution License, which permits use, distribution and reproduction in 
any medium, provided the original work is properly cited. 



scribe their inherent structural plasticity. In this context it is of 
interest that sparsely populated protein conformational states 
(high-energy, excited states) are essential for the functionality 
of all kinds of proteins — ranging from IDPs to tightly struc- 
tured enzymes. [1_3] It is thus mandated that the conventional 
notion "protein structure determines function" needs to be re- 
interpreted (up to now predominantly ground state structures 
were considered). 

Evidently disordered systems are not amenable to X-ray crys- 
tallography, which is by far the most widespread structural 
biology technique. Instead these systems require alternative 
technological strategies. Nuclear magnetic resonance (NMR) 
spectroscopy offers unique possibilities for this quest. Because 
NMR is performed in solution there are different sample re- 
quirements. As such, it can also be applied to molecular sys- 
tems displaying substantial degrees of conformational flexibili- 
ty. Over the years, NMR methodologies were developed to in- 
vestigate protein complexes. NMR chemical shift changes and 
saturation techniques are used to locate protein interaction 
sites. Residual dipolar couplings (RDC) [4,5] together with nucle- 
ar-Overh a user-effect (NOE)-based methods [6] and paramagnetic 
relaxation enhancement (PRE) [7] can be used to evaluate bind- 
ing interfaces with residue resolution, even in cases of weak af- 
finities and transient intermolecular contacts. It should be 
noted that NMR not only allows for locating interaction sites 
and provides structural information about the binding inter- 
face, but also allows for the quantification of binding affinities 
(dissociation constants, K D ). For more details and broader cov- 
erage of NMR applications, the reader is referred to excellent 
reviews published over the years. [6,8,9] Here we provide an illus- 
tration of the power of NMR spectroscopy for the elucidation 
of reversible protein interaction events by means of so-called 
relaxation dispersion (Car-Purcell-Meiboom-Gill, CPMG) ex- 
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periments. Applications to studies of protein-protein interac- 
tion are illustrated with a quantitative analysis of Max homodi- 
merization. It is shown that NMR provides detailed information 
about this dynamic protein dimerization process. Both kinetic 
and structural parameters are obtained, which permits the 
analysis of the only marginally populated protein complex. An 
example for studies of enzyme catalysis is given through the 
observation of reversible binding of the cold-shock protein 
CspA to its cognate cold-box RNA. Here, dynamics in unpertur- 
bed equilibrium of both, the protein and RNA components, 
can be studied via NMR spectroscopy. Again, we anticipate 
that these NMR measurements will provide unprecedented in- 
sight into the molecular details and elementary steps of RNA 
chaperone function. Finally, we present a detailed description 
of how NMR and electron paramagnetic resonance (EPR) spec- 
troscopy can be used in a complementary fashion to investi- 
gate complex formation of IDPs. Most importantly, this novel 
approach revealed that upon ligand binding the model IDP Os- 
teopontin largely remains disordered although its conforma- 
tional ensemble is updated. Ligand binding proceeds through 
conformational selection of a preformed conformational sub- 
state that is characterized by an unexpected cooperative stabi- 
lization as common for stably folded globular proteins. The ob- 
served structural and dynamical compensations upon binding 
reveal how thermodynamically unfavorable entropic penalties 
are compensated through partial unfolding of peptide seg- 
ments remote to the primary binding site. 

Results and Discussion 

Access to sparsely populated conformational states by re- 
laxation dispersion NMR experiments 

Biomolecules are characterized by rugged energy surfaces re- 
sulting from attractive and repulsive forces of similar strength. 
The accessible conformational space governs thermodynamics 
and kinetics of conformational transitions between different 
substates. In the past, it has become increasingly evident that 
sparsely populated, high-energy conformational states of a pro- 
tein can be essential for protein function. [3,10] A prominent ex- 
ample is the frequently observed conformational selection 
mechanism for substrate recognition^ 11,121 In this context a sub- 
strate is assumed to interact with a (sparsely) populated high- 
energy (excited) conformation of a protein and to modulate 
the system's conformational ensemble by stabilizing this par- 
ticular high-energy state. However, excited conformational 
states are difficult to access by means of conventional (direct 
detection) NMR, since fast conformational sampling leads to 
averaging of resonances. Averaging "hides" a sparsely populat- 
ed state behind predominating low-energy states. Consider 
a conformational transition process in which a highly populat- 
ed (low-energy) ground state exchanges with a lowly populat- 
ed (high-energy) excited state. The resulting NMR signal is 
a population weighted average of the individual signals stem- 
ming from the two states and dominated by the abundant 
ground-state signal. Detailed information about the sparsely 
populated excited state can nevertheless be obtained by the 



so-called Car-Purcell-Meiboom-Gill (CPMG) relaxation disper- 
sion experiment^ 13 ' 141 This experiment takes advantage of the 
fact that the conformational transitions between a highly (A) 
and a lowly (B) populated species affects the transverse relaxa- 
tion times and thus line widths and intensities of the ground 
states' (A) NMR signals. Suppose a 90° pulse that flips the mag- 
netization of an amide nitrogen of a particular amino acid 
from the quantification axes to the transverse plane. The spin 
will immediately start to precess and concomitantly dephase 
during a given time interval r. Conformational transitions be- 
tween ground and excited state lead to alterations of the pre- 
cession frequencies (Figure 1a, top). Application of a 180° in- 



a) b) 




Figure 1. a) Phase of two spins with resonance frequencies co A ( ) and co B 

( ) in dependence of pulse sequence evolution time and pulse interlay in 

a CPMG experiment. Corresponding detectable NMR signals on the right. 
With increasing v c ?mg> the signal becomes sharper, b) Theoretically calculated 
dependence of /? 2eff on v CPMG . 

version pulse after the time interval r leads to partial refocus- 
ing of the precessing magnetization and hence increased 
signal intensities after the interval 2r (Figure 1 a, middle). In- 
creasing the number of 180° inversion pulses improves the re- 
focusing efficiency in the CPMG experiment and results in in- 
creased signal intensities (Figure 1 a, bottom). Depending on 
the overall exchange rate between the two states A and B, k = 
^a + ^b/ (with k A and k B being the forward and backward rates) 
the number of refocusing pulses will alter the observed signal 
intensity. If the exchange rate k is slower than 1/r, the precess- 
ing magnetization will be completely refocused and signal in- 
tensity is unaffected by the exchange process and hence maxi- 
mized. Yet, if k is in the range of or larger than 1/r the ob- 
served signal intensity will be modified and depend on the fre- 
quency (^cpivig) of 180° inversion (refocusing) pulses. In other 
words, the effective (detected) transverse relaxation rate, R 2 ^ 
will be affected by the exchange process and the time interval 
r (or the number of 180° inversion pulses, see Figure 1). In- 
creasing the frequency of 180° inversion pulses, where f C PMG = 
1/(4r), in the CPMG train (Figure 1 a) will lead to progressive 
quenching of the exchange process. This dependence is ex- 
ploited by CPMG pulse sequences in which pulse trains with 
repeating r-180° x - T blocks are applied to refocus the magneti- 
zation. Sampling different values of v CPMG yields a dependence 
°f ^2,eff versus v CPMG . /^eff^cPMc) w iH decrease with increasing 
v CPMG values (Figure 1 b). In the case of complete quenching of 
the exchange process, however, one observes a flat profile 
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with /? 2 ,eff— ^2,0 (^2,0 ' s tne exchange-free transverse relaxation 
rate). Fitting /? 2 , e ff(^cPMG) as a function of f C PMG allows to deter- 
mine the exchange rate k between the two states. Further- 
more, the chemical shift difference between A and B and the 
populations of these states can be extracted through measure- 
ments at different magnetic field strengths. [15] Depending on 
the time scales, different approaches have to be taken into ac- 
count for quantitative analysis of fast, slow and intermediate 
exchange regimes. [16] 

The CPMG experiment can also be explained from a kinetic 
point of view by taking into account the deleterious effects of 
randomly changing precession frequencies on echo formation. 
The observed NMR signal, that is, a spin echo, will become 
weaker the more the precision frequencies of the individual 
spins in an ensemble differ from each other. The time needed 
for the spins to be randomly distributed in the transverse 
plane after RF excitation is inversely proportional to the spec- 
trum of precision frequencies (line width). Conformational ex- 
change modulation of precision frequencies due to changes in 
the local environment of some spins leads to additional line 
broadening and concomitant signal attenuation. Taking a look 
at the CPMG pulse train in Figure 1 a one can see that this 
effect will only be effective if the time between two pulses is 
longer than the spin needs (on average) to change its preces- 
sion frequency due to conformational changes in the system. 
In other words, the frequency of refocusing pulses (v C pmg) must 
be shorter than the exchange rate k in order to observe en- 
hanced signal attenuation. Thus, by varying v cmG and observ- 
ing the consequent effect on the signal intensity one can yield 
information about k and the details of conformational ex- 
change processes (frequency difference between individual 
states). 

In the realm of protein NMR, one frequently observes the re- 
laxation dispersion of backbone ^N-'H amide resonances. Im- 
plementation of the CPMG train into a two dimensional data 
acquisition yields residue-dependent /? 2;eff values. In this case, 
a residue's exchange frequency might be affected by local con- 
formational interconversion processes, for example, flapping of 
aromatic side chains, [17] and by global processes like large scale 
conformational adaptations due to, for example, ligand bind- 
ing. In order to separate global from local influences, one can 
choose to fit the dispersion profiles in a global or residue-spe- 
cific manner. Numerous applica- 
tions have demonstrated the 
wide applicability of this ap- 
proach and provided valuable 
insight into the atomic details of 
relevant dynamic processes in 
biology. [18] Here we illustrate the 
technique with applications to 
a transiently formed chaperone- 
RNA complex and the reversible 
formation of a homodimeric 
coiled-coil protein complex. 

As a case study of lowly popu- 
lated state of ligand-bound pro- 
tein, we present here the inter- 




Figure 2. Structural model of CspA with surface hydrophobic residues 
shown in magenta; the primary RNA binding site is represented in green 
(residues 26-41). Structure PDB ID: 2L15. [19] 



action of the Escherichia coli cold-shock protein CspA (the 3D 
structure of the apo-state of CspA is given in Figure 2) with 
a fragment of its mRNA. The CspA is a 70 amino acid 7.4 kDa 
protein naturally occurring in cold-shocked bacteria. It func- 
tions as an RNA chaperone by interacting with the 5'-untrans- 
lated region (UTR) of its own mRNA (called the cold-box RNA, 
CB RNA); [20] overproduction of the CspA mRNA causes dere- 
pression of the cold-shock genes expression. The CspA struc- 
ture comprises a p-barrel fold. The RNA binding surface is pri- 
marily constituted of solvent-exposed aromatic residues, 
mostly phenylalanines^ 20,211 However, the RNA-bound state of 
the protein is difficult to assess by means of directly detected 
NMR, since only a minor fraction of the present RNA molecules 
interact with CspA at any given time, and the time frame of 
the interaction is very short. Saturating the population of the 
bound state is unfeasible due to the progressive signal loss 
with increasing RNA concentration as a consequence of signifi- 
cant line broadening, especially in the binding sites. [21] Most 
other techniques for measuring protein-ligand interaction are 
not applicable since the population of the bound-state protein 
is usually below their limit of detection. 

Thus, the CPMG relaxation dispersion (RD) experiment is 
a promising technique to supply information about the sparse- 
ly populated RNA-bound state of CspA. Here, we provide 
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Figure 3. CPMG relaxation dispersion profiles for CspA residues a) F12 and b) F18. Relaxation profiles of the CspA 
in the presence of 10% ACB (A and ■) and in the absence of RNA (#). Black squares (■) show the relaxation 
profile at 800 MHz NMR field frequency and red triangles (A) show the profile measured at the frequency of 
500 MHz; solid lines indicate "global" data fits. 
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15 N-CPMG data of residues F12 and F18 (Figure 3) to illustrate 
how this technique can be used to analyze the transiently 
formed CspA-RNA complex. The employed RNA interaction 
partner is the so-called anti-cold-box (ACB) RNA, which is par- 
tially complementary to the cold-box RNA. 

For CspA residues F12 and F18, 15 N-dispersion relaxation 
profiles were measured in the absence and in the presence of 
ACB RNA (Figure 3). In the absence of RNA, no exchange pro- 
cess can be detected as reflected in the flat dispersion relaxa- 
tion profiles. However, in the presence of 10% molar ratio of 
ACB RNA, distinct changes in relaxation profiles were observed. 
Global fitting based on an intermediate exchange model, [16] 
yielded 1.6% of the conformational ensemble to comprise the 
CspA RNA-bound state; the exchange frequency was deter- 
mined to be 1.4 kHz. The participation of the phenylalanine 
residues F12 and F18 (highlighted in Figure 2) adjacent to the 
primary RNA binding site during the binding event was previ- 
ously described. [21] It is noteworthy that the relaxation disper- 
sion profiles not necessarily reflect the kinetics of the RNA 
binding event, but the frequency of the structural rearrange- 
ments in the region comprising residues F12 and F18. Howev- 
er, it has been shown that the RNA binding triggers structural 
adaption of the protein. Hence, it is reasonable to correlate the 
exchange frequency (1.4 kHz) fitted globally to F12 and F18 
with the effective time constant of the RNA binding/release. 
Referring to the structural model shown in Figure 2, the F12 
and F18 residues are positioned sideways to the primary bind- 
ing site and can therefore establish stacking interactions with 
the ribose moieties in the bound RNA strand. Other residues 
not present in either the primary or the secondary binding site 
do not exhibit changes in /? 2eff as observed for F12 and F18 
(data not shown). Thus, CPMG analysis allows for determina- 
tion of kinetics of a state as lowly populated as 1.6%, which 
would be difficult to assess by other techniques. Further, the 
method used here enables one to obtain information not only 
on kinetics and population, but also yields the nitrogen chemi- 
cal shift differences between the free and the bound state of 
residues observed by CPMG data that correlate with the mag- 
nitude of changes in the chemical environment. Thus, in princi- 
ple one could extract structural information about the bound 
state of CspA by means of CPMG through the chemical shift 
difference to the apo-state if a sufficiently high number of resi- 
dues yield exchange-affected dispersion profiles. The CspA 
case study presented here demonstrates the power and versa- 
tility of the CPMG method in assessing the lowly populated 
states important in RNA-protein interactions. 

To complement the results obtained from CspA-derived 
CPMG measurements, we also focused on the investigation of 
micro- to millisecond dynamic processes of the CB and ACB 
RNAs, which interact with CspA. In contrast to protein NMR 
spectroscopy, where the backbone amide nitrogen proton spin 
pairs represent suitable reporter nuclei for CPMG RD experi- 
ments, for nucleic acids, 13 C-CMPG RD experiments are the 
method of choice. [22] This is due to the fast exchange rates of 
the imino protons, that is, nitrogen-bound protons (H3 in uri- 
dine and H1 in guanosine) directly involved in base pairing, 
with bulk water leading to significant line-broadening effects 



or even to a total collapse of the imino proton signal within 
the dominant water resonance. But as in protein NMR spec- 
troscopy, uniform 13 C/ 15 N-labeling patterns impair the applica- 
tion of certain NMR experiments to address dynamic features, 
like the CPMG RD experiment. A uniform 13 C/ 15 N-labeling pat- 
tern introduces one-bond scalar and dipolar 13 C- 13 C couplings 
resulting in resolution and sensitivity problems especially for 
larger RNA comprising more than 30 nucleotides (nt) but also 
to artifacts in the CPMG RD data sets impairing a reliable analy- 
sis. [23] To circumvent several issues arising from the state-of- 
the-art uniform labeling protocol using 13 C/ 15 N-modified ribo- 
nucleotide triphosphates (rNTPs) and T7 RNA polymerase, sev- 
eral approaches were proposed. [24] In one of our laboratories, 
we have recently put focus on isotope labeling RNAs via the 
solid-phase synthesis approach. For example, a minimally inva- 
sive 19 F-based isotope labeling protocol can be very efficiently 
applied to RNA as all four 2 / -fluorine modified nucleotide phos- 
phoramidites are commercially available and can be incorpo- 
rated into the target RNA without modifying the standard pro- 
tocol (Figure 4a). The 19 F nucleus has favorable NMR spectro- 
scopic features, like 100% natural abundance, an intrinsic NMR 
sensitivity almost as high as protons (83%) and a vast chemical 
shift range about 100 times larger than protons. It represents 
a bioorthogonal reporter spin for nucleic acids and was effi- 
ciently used in RNA folding and RNA ligand binding NMR 
assays. [25_27] We currently introduce 2 / -fluorine labels into the 
CB and ACB RNAs to study the interaction of both RNAs with 
CspA but also to address conformational dynamics via 
19 F-CPMG relaxation dispersion experiments (data not shown). 
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Figure 4. a) Stable isotope labeling patterns that can be introduced via the 
solid-phase RNA synthesis approach. 2'-Fluorine and 6- 13 C-pyrimidine labels 
are shown, b) Cold-box RNA with five 6- 13 C-uridine labels and the 1H- 13 C- 
HSQC spectrum with assignments, c) Selected 13 C-CPMG relaxation disper- 
sion profiles at 125 MHz carbon larmor frequency of the cold-box RNA. Resi- 
due U6 shows a non-flat dispersion profile indicating microsecond to milli- 
second dynamics, whereas U22 displays no dynamics on that timescale. 
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We also focused on the chemical synthesis of site-specifically 
13 C-modified pyrimidine phosphoramidites (Figure 4 a). [28] This 
labeling protocol results in functionally unperturbed RNAs at 
the costs of the labor-intensive production of the 6- 13 C-uridine 
and -cytidine phosphoramidites. For example, all uridines in 
the aforementioned stem-loop motif (CB RNA, Figure 4b) resid- 
ing in the 5'-UTR of the E. coll cspA messenger RNA (mRNA) 
were replaced by the 6- 13 C-modified counterparts. We then 
used 13 C-CPMG relaxation dispersion experiments to address 
the micro- to millisecond dynamics of this very RNA. Prelimina- 
ry results are shown (Figure 4c) and hint towards a dynamic 
hotspot of the cold-box RNA in the vicinity of the G»U wobble 
base pair. A quantitative description of this dynamic phenom- 
enon and the possible biological relevance for the interaction 
with CspA are currently investigated in our research groups. 

As a third example for the application of CPMG techniques 
to transient protein interaction studies, we outline an applica- 
tion to the reversible protein dimerization of the Myc-associat- 
ed factor X (MAX). [29] MAX readily forms heterodimers with its 
authentic binding partner Myc yielding an active transcription 
factor complex. Deregulation of this tightly controlled protein 
complex formation can lead to severe disease phenotypes. [29] 
Under neutral pH conditions and room temperature, MAX na- 
tively populates predominantly a dimeric, coiled-coil a-helical 
motif (see Figure 5 a). [30] Changing solution conditions such as 
pH and temperature, however, allows for controlled manipula- 
tion of the monomer-dimer equilibrium. [30] In Figure 5 b, a 
15 N- 1 H HSQC of MAX is shown under conditions that favor the 
monomeric form (pH 5.5, 35°C). [30] The strong spectral overlap 



indicates that the prevailing monomeric form of MAX is to 
a large degree conformationally disordered. In Figure 5 c,d we 
show CPMG profiles of two residues, Ser32 and Glu83, of MAX 
under these conditions at 500 and 800 MHz NMR static field 
strength. The observed exchange process corresponds to inter- 
conversion of monomer and dimer MAX states. [30] Fitting the 
data globally yields a population of the MAX dimer of 0.75% 
with an exchange rate of 0.4 KHz. [16] This is in excellent agree- 
ment with data obtained through a thermodynamic analysis of 
the MAX monomer-dimer equilibrium^ 301 

The reliability of the CPMG results can be further tested by 
comparison of the chemical shifts extracted from the CPMG 
data with literature data for the dimeric state of MAX. From 
earlier publications, for example, the co N of the bound (dimeric) 
state of MAX is known, while the chemical shift of the mono- 
meric form can be directly observed in ^N^H HSQC spectra 
(since the monomeric form of MAX is the dominating species 
under these conditions). The differences between monomeric 
and dimeric chemical shifts should coincide with individual 
dco's obtained from a fit of CPMG data. As indicated in Fig- 
ure 5b, for Ser32 excellent agreement between fitted and ex- 
perimental chemical shift differences were found (dco fit = 
8.0 ppm and w N Dimer -w N Monomer = 7.8 ppm). [29] Because the 
chemical shift change depends on molecular rearrangements 
(e.g., through changes in conformation, local hydration, hydro- 
gen bonds etc.), unique information can be obtained about 
the atomic details of sparsely populated protein states even 
under conditions where other experimental techniques fail as 
a consequence of the low population. 
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Figure 5. a) Sketch of the dimerization equilibrium of MAX. b) 15 N- 1 H HSQC of the monomeric state of MAX. The 
red dot indicates 15 N and 1 H frequencies for residue S32 in the homodimeric coiled-coil state (taken from ref. [29]). 
c,d) CPMG relaxation dispersion profiles at 500 (red) and 800 (black) MHz. c) Residue Ser32 d) Residue Glu83. The 
solid lines represent fits of the data with an intermediate exchange model. 



Assessing IDP substates by 
means of combined EPR and 
NMR 

The hallmark of intrinsically dis- 
ordered proteins (IDPs) is their 
vast and heterogeneous confor- 
mational ensemble comprising 
both extended conformations 
and simultaneously compact rea- 
sonably stable structures display- 
ing significant resistance against 
denaturing agents (like urea). [31] 
The rugged energy surface typi- 
cally found for IDPs allows for 
conformational transitions be- 
tween different substates and 
endows IDPs to transiently inter- 
act with a multitude of binding 
partners via, for example, confor- 
mation selection type processes. 
Despite their enormous biologi- 
cal relevance, the structural and 
dynamical characterization of 
this important protein family is 
still far from routine. Most im- 
portantly, the existence of 



© 2014 The Authors. Published by Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim 



ChemistryOpen 201 4, 3, 1 1 5 - 1 23 119 



ChemistryOPEN 

Full Papers 



www.chemistryopen.org 



sparsely populated substates in the conformational ensemble 
requires highly sensitive experimental techniques to probe 
their structural features and encounters with protein binding 
partners. It was recently demonstrated that electron paramag- 
netic resonance (EPR) spectroscopy is a very powerful experi- 
mental tool to access sparsely populated states and transient 
protein interactions. [32_35] Furthermore, EPR spectroscopy is 
a promising candidate to supplement NMR data in terms of 
time and length scales. Some pilot studies of EPR on IDPs have 
already been published/ 32 " 351 yet the strength of EPR, that is, 
the detection of through-space dipolar interaction of electron 
spins by double electron-electron resonance (DEER) spectros- 
copy, [36_39] has turned out to be difficult to apply to IDPs. The 
commonly applied analysis techniques for DEER raw data fail 
in the case of IDP conformational ensembles. [40] Therefore, we 
have amended the repertoire of analysis methods for DEER 
data with an approach that is suitable to interpret data ob- 
tained for IDPs comprising large and heterogeneous conforma- 
tional ensembles. In the following section, we will briefly intro- 
duce the standard DEER methodology and explain our expan- 
sion of this method to IDPs. Our investigations based on this 
novel approach are accompanied by complementary NMR ex- 
periments. Generally speaking, we show how NMR and EPR 
data supplement each other and provide a comprehensive pic- 
ture of IDPs' structural dynamics. 

In order to perform DEER, one has to modify a protein with 
two labels that both carry an unpaired electron, hence, the 
term spin label. [41] The free electrons of the two labels will ex- 
perience a through space dipolar coupling. However, since at 
room temperature dipolar couplings between two labels in so- 
lution average to zero, the sample has to be freeze-quenched. 
As such, one measures the intramolecular dipolar couplings 
between every pair of spin labels "locked in" at the conforma- 
tion at the glass transition temperature of a sample (after cor- 
recting for intermolecular background contributions). In order 
to prolong relaxation times, DEER is, yet, measured at even 
lower temperatures, typically between 20 and 50 K. The dipolar 
couplings manifest themselves as modulation of DEER time 
domain data, that is, the signal intensity as a function of the 
pulse sequence evolution time. [31,38,42] For a single, fixed dis- 
tance, one would get a single cosine modulation of the time 
trace. Yet, as the conformational ensemble grows, the number 
of populated substates increases, and each substate contrib- 
utes to the DEER time trace. The frequency of the cosine mod- 
ulation is dependent on the interspin distance. For the 
common DEER analysis methods, the individual contributions 
of the different substates are given primarily by their weighted 
populations (note that average interspin distances can become 
of importance in large disordered systems, too). [31] With in- 
creasing number of populated conformations and correspond- 
ing interspin distances the total, observable modulations will 
become increasingly blurred and will ultimately converge to 
an exponential decay. This circumstance gives rise to particular 
problems in separating intra- and intermolecular contributions 
to the DEER signal for broad distributions of distances. In the 
case of doubly labeled proteins, one typically wants to elimi- 
nate intermolecular contributions. For intrinsically disordered 



systems, this can only be achieved by measuring DEER referen- 
ces on singly spin-labeled proteins, which yields the pure inter- 
molecular contributions/ 411 

In case of folded proteins, after removal of background con- 
tributions, the spin label distance distribution P{R) can be ob- 
tained by well-established procedures such as the Thikonov 
regularization. [41,43] Yet, in the case of IDPs the broad and inho- 
mogeneous distance distributions leads to less significant time 
trace modulations and poor signal-to-noise ratios that impairs 
the extraction of feasible distance distributions (e.g., like the 
traces shown in Figure 6). [40] Thus, we proposed an alternative 
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Figure 6. DEER time traces of a double mutant comprising the central OPN 
segment from residue C108 to C188 at different urea concentrations. 

approach based on a so-called effective modulation depth, 
A effl for data interpretation of IDPs. Details of this novel ap- 
proach were explained elsewhere. [31] In short, A eff denotes the 
signal decay of the time trace, V(t), at a given point of time, f eff . 
A eff can be defined as: A eff = 1 -V(t efi )/V{t=0). For large IDPs, we 
suggest to simply choose the longest experimentally achieva- 
ble DEER evolution time for f eff . A eff is an approximate measure 
of average interspin distance for broad P(/?)s. Yet, the reader 
should be aware that A efi does not linearly depend on the 
population-weighted average distance in the measured ensem- 
ble but is a complex function of several spectroscopic parame- 
ters. [31] To a first approximation, however, A efi decreases with 
increasing interspin distance R for broad distance distributions. 
With A eff as tool in hand, we probed structural preferences of 
Osteopontin (OPN), a cytokine involved in metastasis of several 
kinds of cancer. [44] Typical DEER data obtained on an IDP are 
shown in Figure 6. 

Conformational stabilities, understood as resistance to urea 
unfolding, of several individual structural segments of OPN 
were investigated by recording DEER time traces for different 
spin labeled double mutants in dependence of increasing urea 
concentration. As such, a decrease of A eff with increasing urea 
concentration is representative for unfolding and expansion of 
a doubly spin-labeled protein of interest. In Figure 6 exemplari- 
ly experimental zl eff values are shown as a function of urea 
concentration for a selected double mutant comprising a cen- 
tral segment of the cytokine OPN. The time traces appear as 
exponential decays because of the aforementioned conver- 
gence of cosine functions (one for each conformation in the 
OPN ensemble). With increasing urea concentration A eff de- 
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creases indicating an expansion of the IDP. As published earli- 
er/ 311 the C-terminal part of OPN exhibits an exponential decay 
of A eff with increasing urea concentration. This gives rise to 
a steep slope of the Zl eff function and can be regarded as a de- 
naturation profile of an unstably folded protein segments of 
potentially random-coil-like character. Already for low urea 
concentrations such segments show significant conformational 
expansion (i.e., a decrease in A eff ) in accordance with the idea 
of very low stability of transient or residual structural elements 
in IDPs. For other mutants, one might observe an approximate- 
ly linear decrease of A eff with urea concentration, indicating 
that the OPN segment framed by these mutants is on average 
conformationally more stable than the C-terminal segment. 
Nevertheless, it is still largely unstructured, random coil- or 
(pre)molten globule-like. Strikingly, however, for a mutant of 
OPN with two terminal labels, we observe a sigmoidal devel- 
opment of the zl eff -derived denaturation profiles with urea con- 
centration (see Figure 7). Sigmoidality is a hallmark for cooper- 
ative folding of protein conformations and unexpected for an 
IDP. [45] 
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Figure 7.A eff for an OPN mutant (C54-C247) comprising nearly the whole 
protein as a function of urea concentration. Error bars stem from signal 
noise. 



A sigmoidal denaturation profile is indicative for stably and 
cooperatively folded tertiary structures of OPN, since for low 
urea concentrations of up to 0.75 m the whole protein does 
not expand significantly (as seen in nearly constant A eff values). 
This observation of a cooperatively folded conformation is sur- 
prising since distance distributions between two labeled resi- 
dues of OPN are generally quite broad, as deducible from prior 
studies concerning OPN's conformational space [11] and as re- 
flected in the non-modulated DEER time traces (Figure 6). This 
interesting finding can, however, be understood by concluding 
that the structural ensemble of OPN contains both coopera- 
tively folded and unfolded conformations and that both con- 
tribute to the DEER signals. [31] This deduction is possible here 
only because DEER EPR on freeze-quenched solutions eluci- 
dates the whole set of co-existing conformations; ensemble 
averaged data here would not allow for discerning between 
partial structuring and sampling of compact conformations. 
Most importantly, the existence of structural cooperative transi- 



tions from folded to unfolded states and vice versa as moni- 
tored by EPR in combination with NMR in IDPs calls for a novel 
conceptual view of IDPs that goes beyond the traditional 
binary scheme of order versus disorder. The subtleties of heter- 
ogeneous conformational sampling in IDPs and their putative 
relevance for biological functions have to be adequately ad- 
dressed. 

The applicability of this novel EPR-NMR approach to the ob- 
servation of transient complex formation of IDPs was also re- 
cently demonstrated. Not only can the A eff approach be uti- 
lized to elucidate structural preferences of IDPs complementa- 
ry to NMR data, but also ligand interaction can be observed in 
high detail. We analyzed the interaction of OPN with hepa- 
rin, [11] a highly sulfated glycosaminoglycan widely used as anti- 
coagulant (see Figure 8 a). [10] In a biological context, heparin 
binding to OPN is of interest since it models the OPN-heparan 
sulfate interaction, which constitutes a crucial cofactor in OPN- 
CD44 receptor association, a process involved in cell signaling 
and adhesion. [46] Additionally, interactions between IDPs and 
biological polyelectrolytes are quite common, 131 and our results 
might well be applicable to other systems. 

Upon binding to heparin, OPN largely remains disordered al- 
though its structural ensemble is updated. For several doubly 
spin-labeled mutants of OPN, heparin binding leads to a clear 
decrease in A eff indicating longer distances between the label- 
ing sites of the six double mutants and an expansion of the 
protein upon heparin binding. This is shown in a previous 
paper. [10] This information from EPR agrees well with informa- 
tion concerning the binding event gained from NMR-based 
paramagnetic relaxation enhancements (PREs). [10] When inter- 
preting PREs of IDPs, one should be aware that due to the 
rapid conformational sampling of IDPs one observes ensemble 
averaged PRE data. [47] Hence, all conclusions drawn from these 
refer to "average" conformations and differential values (bound 
minus apo-state PREs), APRE>0 indicate "on average" increas- 
ing distance in the protein complex between labeling site and 
a residue upon binding, APRE<0 the opposite. As can be ob- 
served in Figure 8 b, OPN displays differential changes of long- 
range backbone interactions as heparin binds. The central spin 
label attached to C108 experiences a displacement from the 
core region around residues 130-190, whereas the spin label 
C188 is separated from the region comprising residues 90 and 
120. Information on the binding process and its impact on 
OPN's dynamic behavior can be obtained when comparing the 
15 N NMR relaxation rates and heteronuclear 15 N-H NOEs 
(hetNOE) of bound and free forms. NMR relaxation reports on 
motions ranging from picosecond to low-nanosecond time- 
scales. The three measurable NMR relaxation parameters {R y R 2 
and hetNOE; Figure 8c,d) have different dependencies on the 
time scales of motions. While the 15 N transverse relaxation rate 
R 2 reports on nanosecond motions, the heteronuclear ^N-'H 
NOE depends on the efficiency of magnetization transfer from 
15 N to 1 H in the protein backbone, and which is mostly influ- 
enced by very fast (picosecond time scale) dynamics of the N— 
H bond. Figure 8d shows that upon heparin binding to OPN, 
very fast motions (probed by 15 N-H hetNOE) increase in the 
region located around residue 120. The central region around 
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Figure 8. a) Schematic representation of the OPN — heparin interaction, b) Differential PRE values for the two cen- 
tral mutants (PRE of OPN and heparin; PRE of OPN). The positions of spin labels are indicated (red crosses), c) Ap- 
proximate residue charge of OPN. The color code is identical to the schematic representation of the binding pro- 
cess in Figure 8a. d) Differential transverse relaxation and heteronuclear NOEs (OPN and heparin; OPN). 



residue 150, however, shows increased 15 N transverse relaxa- 
tion rates R 2 due to decreased conformational flexibility in the 
binding cleft. Closer inspection of the charge distribution in 
OPNs primary sequence reveals that the differential mobility 
changes upon heparin binding are linked to the electrostatic 
pattern found in OPN (increased mobility in negatively charged 
regions and decreased mobility/rigidification in positively 
charged patches). 

Through combination of NMR and EPR a clear picture arises 
of the OPN-heparin binding event. As the highly negatively 
charged heparin contacts OPNs positively charged core region, 
the structural and dynamic properties of the OPN ensemble 
get drastically altered. Along with an increase in dynamics 
along most of the proteins backbone, an average increase in 
distances is observed. This unfolding-upon-binding event is de- 
picted in Figure 8a. The interaction of heparin (sticks) with 
OPNs positive core (blue) leads to a displacement of the N-ter- 
minal negatively charged region (red) either by direct repulsion 
through heparin or by loss of contacts to the positive region, 
explaining both the average increase in distances and the dy- 
namic changes. 

From isothermal titration calorimetry (ITC) measurements, 
AH and AS values of -16.3 kCalmol" 1 and -35 calmolK" 1 , re- 
spectively, can be obtained for the heparin binding event of 
OPN assuming an average molecular weight of 17.5 kDa for 
heparin. These quite large AH and AS values nearly cancel 
each other at 293 K in the Gibbs energy (AG = AH-7A5). Thus, 
although there is local rigidification in the heparin binding 
cleft (region around residue 155) the resulting conformational 
entropy penalty is reduced by a compensatory increase in con- 
formational flexibility of the negatively charged region 90-120. 



The local (or segmental) unfold- 
ing and expansion of the OPN 
core segment thereby signifi- 
cantly contributes to the overall 
thermodynamic equilibrium bal- 
anced between counteracting 
contributions like solvation en- 
thalpy, rotational and translation- 
al degrees of freedom and con- 
formational entropy. [48,49] 

Conclusions 

Three examples for weak and re- 
versible protein interactions are 
presented in combination with 
magnetic resonance based ex- 
perimental techniques to access 
these interactions. Relaxation 
dispersion (Car-Purcell-Mei- 
boom-Gill, CPMG) experiments 
allow for determination of 
sparsely populated states in 
CspA-RNA complexation and 
MAX-MAX homodimerization. 
Since both kinetic data about 
complex formation (association and dissociation rates) as well 
as structural details of lowly populated (high-energy) confor- 
mational states are accessible by this technique, unique infor- 
mation about protein interactions becomes amenable. Further, 
the combination of paramagnetic relaxation enhancement 
(PRE) NMR and double electron-electron resonance (DEER) 
electron paramagnetic resonance (EPR) gives rise to the unpre- 
cedented and unexpected observation of cooperatively folded 
conformational states contained in the conformational ensem- 
ble of Osteopontin (OPN) and its changes in the weak OPN- 
heparin interaction. Our findings suggest that more elaborate 
conceptual approaches are required for an adequate descrip- 
tion of intrinsically disordered proteins and their conformation- 
al ensembles. 

Proteins are characterized by significant structural plasticity 
and can undergo large structural rearrangements of the time- 
averaged conformational ensemble. It is therefore evident that 
classical structural biology approaches are only starting points 
for a comprehensive analysis of protein function. For intrinsi- 
cally disordered proteins (IDPs) flat energy landscapes allow 
for rapid exchange between different conformational isomers 
(substates). These often only sparsely populated conformation- 
al states, yet, lead to the formation of protein complexes that 
are essential for biological function. Detailed knowledge of the 
transiently formed intermediates along the reaction trajectory 
will be highly valuable. Further developments of new tech- 
niques but also the amendment of existing ones (as illustrated 
with the novel analysis technique of DEER data) will be neces- 
sary to provide a more complete picture of how proteins form 
biologically active molecular aggregates and perform the 
myriad of essential tasks and how this information can be ex- 
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ploited to manipulate their activities based on knowledge of 
the underlying chemical mechanisms. 

Acknowledgements 

We thank Prof. Dariush Hinderberger for his collaboration. D.K. 
acknowledges a Feodor Lynen scholarship by the Alexander von 
Humboldt-Foundation (Germany). This work was supported by 
the Austrian Science Foundation (FWF) (1844 to CK. and R.K.). 

Keywords: electron paramagnetic resonance • intrinsically 
disordered proteins • nuclear magnetic resonance • protein- 
protein interactions • transient complexes • structure 
characterization of biomolecules 

[1] S. Kumar, B. Y. Ma, C.J. Tsai, N. Sinha, R. Nussinov, Protein Sci. 2000, 9, 
10. 

[2] J. P. Loria, R. B. Berlow, E. D. Watt, Acc. Chem. Res. 2008, 47, 214. 

[3] D. D. Boehr, R. Nussinov, P. E. Wright, Nat. Chem. Biol. 2009, 5, 789. 

[4] J. R. Tolman, K. Ruan, Chem. Rev. 2006, 706, 1720. 

[5] G. Bouvignies, P. Markwick, R. Bruscheweiler, M. Blackledge, J. Am. 

Chem. Soc. 2006, 728, 15100. 
[6] G. S. Rule, T. K. Hitchens, Fundamentals of Protein NMR Spectroscopy, 

Springer, Dordrecht, 2006. 
[7] G. M. Clore, J. Iwahara, Chem. Rev. 2009, 709, 4108. 
[8] H. J. Dyson, P. E. Wright, Chem. Rev. 2004, 704, 3607. 
[9] R. Ishima, D. A. Torchia, Nat. Struct. Biol. 2000, 7, 740. 
[10] D. Kurzbach, T. C. Schwarz, S. Hofler, D. Hinderberger, R. Konrat, Angew. 

Chem. 2014, 726, 3919; Angew. Chem. Int. Ed. 2014, 53, 3840. 
[11] G. Platzer, A. Schedbauer, A. Chemelli, P. Ozdowy, N. Coudevylle, R. 

Auer, G. Kontaxis, M. Hartl, A. J. Miles, B. A. Wallace, O. Glaser, K. Bister, 

R. Konrat, Biochemistry 2011, 50, 6113. 
[12] C.J. Tsai, B.Y. Ma, Y.Y. Sham, S. Kumar, R. Nussinov, Proteins 2001, 44, 

418. 

[13] P. Vallurupalli, D. F. Hansen, L. E. Kay, Proc. Natl. Acad. Sci. USA 2008, 
705, 11766. 

[14] D. M. Korzhnev, L. E. Kay, Acc. Chem. Res. 2008, 47, 442. 
[15] J. Jen, J. Magn. Reson. 1978, 30, 111. 

[16] M. Tollinger, N. R. Skrynnikov, F. A. A. Mulder, J. D. Forman-Kay, L. E. Kay, 

J. Am. Chem. Soc. 2001, 723, 11341. 
[17] F. A. A. Mulder, N. R. Skrynnikov, B. Hon, F. W. Dahlquist, L. E. Kay, J. Am. 

Chem. Soc. 2001, 723, 967. 
[18] I. R. Kleckner, M. P. Foster, Biochim. Biophys. Acta 2011, 7874, 942. 
[19] Y. Tang, W. M. Schneider, Y. Shen, S. Raman, M. Inouye, D. Baker, M.J. 

Roth, G. T. Montelione, J. Struct. Funct. Genomics 2010, 77, 223. 
[20] B. Xia, H. P. Ke, W. Jiang, M. Inouye, J. Biol. Chem. 2002, 277, 6005. 
[21] K. Newkirk, W. Q. Feng, W. N. Jiang, R. Tejero, S. D. Emerson, M. Inouye, 

G. T. Montelione, Proc. Natl. Acad. Sci. USA 1994, 97, 5114. 



[22] Z. Shajani, G. Varani, Biopolymers 2007, 86, 348. 

[23] C. S. Thakur, T. K. Dayie, J. Biomol. NMR 2012, 52, 65. 

[24] K. Lu, Y. Miyazaki, M. F. Summers, J. Biomol. NMR 2010, 46, 113. 

[25] K. Fauster, C. Kreutz, R. Micura, Angew. Chem. 2012, 724, 13257; Angew. 

Chem. Int. Ed. 2012, 57, 13080. 
[26] C. Kreutz, H. Kahlig, R. Konrat, R. Micura, Angew. Chem. 2006, 778, 3528; 

Angew. Chem. Int. Ed. 2006, 45, 3450. 
[27] C. Kreutz, H. Kahlig, R. Konrat, R. Micura, J. Am. Chem. Soc. 2005, 727, 

11558. 

[28] C. H. Wunderlich, R. Spitzer, T. Santner, K. Fauster, M. Tollinger, C. Kreutz, 
J. Am. Chem. Soc. 2012, 734, 7558. 

[29] S. Sauve, L. Tremblay, P. Lavigne, J. Mol. Biol. 2004, 342, 813. 

[30] W. Fieber, M. L. Schneider, T. Matt, B. Krautler, R. Konrat, K. Bister, J. Mol. 
Biol. 2001, 307, 1395. 

[31] D. Kurzbach, G. Platzer, T. C. Schwarz, M. A. Henen, R. Konrat, D. Hinder- 
berger, Biochemistry 2013, 52, 5167. 

[32] M. Drescher, Top. Curr. Chem. 2012, 327, 91. 

[33] A. Kavalenka, I. Urbancic, V. Belle, S. Rouger, S. Costanzo, S. Kure, A. 

Fournel, S. Longhi, B. Guigliarelli, J. Strancar, Biophys. J. 2010, 98, 1055. 
[34] N. L. Pirman, E. Milshteyn, L. Galiano, J. C. Hewlett, G. E. Fanucci, Protein 

Sci. 2011,20, 150. 

[35] J. Habchi, M. Martinho, A. Gruet, B. Guigliarelli, S. Longhi, V. Belle, in In- 
trinsically Disordered Protein Analysis, Vol. 7 (Eds.: V. N. Uversky, A. K. 
Dunker), Humana Press, 2012, pp. 361. 

[36] D. Kurzbach, D. R. Kattnig, B. Z. Zhang, A. D. Schluter, D. Hinderberger, 
Chem. Sci. 2012, 3, 2550. 

[37] M.J.N. Junk, H. W. Spiess, D. Hinderberger, Angew. Chem. 201 0, 722, 
8937; Angew. Chem. Int. Ed. 2010, 49, 8755. 

[38] Y. D. Tsvetkov, A. D. Milov, A. G. Maryasov, Russ. Chem. Rev. 2008, 77, 
487. 

[39] D. Kurzbach, D. R. Kattnig, N. Pfaffenberger, W. Schartl, D. Hinderberger, 

ChemistryOpen 2012, 7, 211. 
[40] G. Jeschke, G. Panek, A. Godt, A. Bender, H. Paulsen, Appl. Magn. Reson. 

2004, 26, 223. 

[41] G. Jeschke, Y. Polyhach, Phys. Chem. Chem. Phys. 2007, 9, 1895. 

[42] A. Schweiger, G. Jeschke, Principles of Pulse Electron Paramagnetic Reso- 
nance, Oxford University Press, New York, 2001. 

[43] G. Jeschke, A. Koch, U. Jonas, A. Godt, J. Magn. Reson. 2002, 755, 72. 

[44] A. Bellahcene, V. Castronovo, K. U. E. Ogbureke, L. W. Fisher, N. S. Fedar- 
ko, Nat. Rev. Cancer 2008, 8, 212. 

[45] K. E. v. Holde, W. C. Johnson, P. S. Ho, Principles of Physical Biochemistry 
Prentice Hall, New Jersey, 1998. 

[46] H. Ponta, L. Sherman, P. A. Herrlich, Nat. Rev. Mol. Cell Biol. 2003, 4, 33. 

[47] G. M. Clore, C Tang, J. Iwahara, Current Opinion in Structural Biology 
2007, 77, 603. 

[48] K. K. Frederick, M.S. Marlow, K. G. Valentine, A.J. Wand, Nature 2007, 
448, 325. 

[49] V. N. Uversky, Int. J. Biochem. Cell Biol. 2011, 43, 1090. 



Received: April 11, 2014 



© 2014 The Authors. Published by Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim 



ChemistryOpen 201 4, 3, 1 1 5 - 1 23 123 



